language n. 1.語言;(某民族,某國的)國語;語調(diào),措詞。 2.(談話者或作者所使用的)言語,語風,文風,文體。 3.專門用語,術(shù)語。 4.(動物的)叫聲;(動作,手勢等所表示的)表意語。 5.【自動化】機器代碼 ( = machine language )。 6.〔俚語〕粗話,罵人的話;壞話。 7.態(tài)度,立場。 8.〔古語〕民族;某國國民。 a common language 共同的語言。 a dead language 死語言。 a foreign language 外國語。 a living language 活語言。 long language (與符號語言相對的)通用語言。 oral [spoken] language 口語。 the Chinese language 漢語。 written language 書面語。 high language 夸張的言詞。 in his own language 按他自己的說法。 with a great command [an easy flow] of language 口若懸河。 legal language 法律用語。 medical language 醫(yī)學用語。 parliamentary language 議會辭令;有禮貌的話。 the language of diplomacy 外交辭令。 the language of the science 科學用語。 finger [gesture, sign] language 手勢語。 the language of flowers 花語〔如以 lily 象征純潔等〕。 the language of the eyes 目語,眉目傳情。 billing gate language = language of the fish-market 下流的粗話。 in strong language 用激烈的下流話。 use (bad [foul, warm]) language to sb. 謾罵某人。 in fourteen languages 〔美俚〕非常。 speak the same language 說共同的語言,信仰和觀點相同。 language arts (中小學的)語言藝術(shù)學科。
The maximum entropy principal proved to be a very useful method to create statistical language model 近幾年的自然語言處理研究表明,最大熵原理是建立自然語言統(tǒng)計模型的一個很有效的方法。
In statistical language , we would say that the relative frequency become stable as the number of tosses becomes large ( if we are tossing the coin under uniform conditions ) 用統(tǒng)計學的語言來說就是隨著拋擲增加(假定拋幣條件不變) ,相對頻率越來越穩(wěn)定。
To retrieve information with more knowledge of language itself , statistical languages model for information retrieval was proposed a few years ago and develops fast 為了利用語言知識進行檢索,近年來基于統(tǒng)計語言模型( slm - based )的信息檢索得到了快速發(fā)展。
An object - oriented chinese statistical language modeling toolkit is presented . the original trigram model is improved to have more capabilities of long dependency 論文對原有trigram的hmm統(tǒng)計模型進行改進,使其具有更多的長距依存能力,促進統(tǒng)計語言模型在中文自然語言處理領域的應用。
Caption recognition feature extraction using wavelet transformation and the combination of statistical language model and hidden markov model methods finally achieved the identification of caption 基于統(tǒng)計機器學習的字幕識別提取小波變換的特征并使用隱馬爾可夫模型和統(tǒng)計語言模型的識別技術(shù)相結(jié)合的機器學習方法,實現(xiàn)字幕文字的識別。
Neural networks are used more frequently in lossy data coding than in general lossless data coding , because standard neural networks must be trained off - line and they are too slow to be practical . in this thesis , statistical language model based on maximum entropy and neural networks are discussed particularly . then , an arithmetic coding algorithm based on maximum entropy and neural networks are proposed in this thesis 傳統(tǒng)的人工神經(jīng)網(wǎng)絡數(shù)據(jù)編碼算法需要離線訓練且編碼速度慢,因此通常多用于專用有損編碼領域如聲音、圖像編碼等,在無損數(shù)據(jù)編碼領域應用較少,針對這種現(xiàn)狀,本文詳細地研究了最大熵統(tǒng)計語言模型和神經(jīng)網(wǎng)絡算法各自的特點,在此基礎上提出了一種基于神經(jīng)網(wǎng)絡和最大熵原理的算術(shù)編碼方法,這是一種自適應的可在線學習的算法,并具有精簡的網(wǎng)絡結(jié)構(gòu)。
The section language model makes the improvement on the traditional statistical language model in two aspects : firstly , aimed at the situation that the conceptual base can possibly correspond to many words in language , but the words in query are merely its particular cases , this paper has introduced the correlation vocabulary table . it contains all the possible words that may correspond to each conceptual base . when constructing the language model , not only the query ’ s words are considered , but also all the words corresponding 段語言模型對傳統(tǒng)的統(tǒng)計語言模型在兩個方面進行改進:首先針對概念基在語言中可以對應多個詞匯,而查詢語句中的詞匯僅僅是其特例的情況,本文引入了相關(guān)詞表的概念,在相關(guān)詞表中維護了每個概念基對應的所有可能的詞匯表示,在構(gòu)建語言模型時不只是根據(jù)查詢語句的詞匯,而是通過查詢語句中概念基的所有相關(guān)詞匯,這就有效的提高了檢索結(jié)果的召回率。
Owing to the peculiarity of mandarin spoken language , the recognizer adopts the sonic model based on hmm , the mixed statistical language model based on both phrases and phrase class . moreover , it makes use of word tree viterbi - beam searching guided by language model and the sylable - filling model . a method of speech identifying and denying is used in the processing of vocabulary out of collecting , non - speech and noises 針對漢語口語的特點,識別器采用了基于hmm的聲學模型,基于詞和基于詞類的混合統(tǒng)計語言模型,及由語言模型引導的詞樹viterbi - beam搜索,并且采用基于音節(jié)的填充模型、話語確認和拒識等方法對集外詞、非語聲和噪音進行處理。